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ABSTRACT 


The  state  estimation  problem  for  a certain  class  of  nonlinear 
stochastic  systems  with  white  Gaussian  plant  and  observation  noise  is 
considered.  The  optimal  (minimum  variance)  estimators  for  these  systems 
are  recursive  and  finite  dimensional.  A particular  nonlinear  system 
which  contains  a polynomial  nonlinearity  is  presented.  Both  optimal  and 
suboptimal  estimators  and  an  estimation  lower  bound  for  such  a system 
are  derived.  The  performance  of  the  optimal  and  suboptimal  estimators 
and  the  lower  bound  are  compared  both  analytically  and  by  computer 
simulation. 
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I.  Introduction 

Optimal  estimators  have  been  derived  for  very  general  classes 
nonlinear  systems  [ 1 ] » [2] . However,  the  optimal  estimator  requires, 
in  general,  an  infinite  dimensional  computation  to  generate  the 
conditional  mean  of  the  system  state  given  the  past  observations.  This 
computation  involves  either  the  solution  of  a stochastic  partial 
differential  equation  for  the  conditional  density  or  an  infinite 
dimensional  system  of  coupled  ordinary  stochastic  differential  equations 
for  the  conditional  moments.  For  linear  stochastic  systems  with  linear 
observations  and  white  Gaussian  plant  and  observation  noises,  it  is  known 
that  the  optimal  conditional  mean  estimator  consists  of  a finite 
dimensional  linear  system  (the  Kalman-Bucy  filter  [3]).  Many  types  of 
finite  dimensional  suboptimal  estimators  for  nonlinear  systems  have  been 
proposed  [4],  and  many  numerical  experiments  have  been  performed  to 
compare  the  various  suboptimal  estimators.  In  addition,  upper  and  lower 
bounds  on  the  performance  (error  covariance)  of  optimal  and  suboptimal 
estimators  have  been  derived  for  certain  classes  of  nonlinear  systems 
[5]-[10] . However,  throughout  most  of  the  previous  research,  the 
question  of  how  good  the  performance  of  a suboptimal  estimator  (or  a 
lower  bound)  Is,  as  compared  to  the  performance  of  the  optimal  estimator 
(not  as  compared  to  that  of  other  suboptimal  estimators),  has  remained 
unexamined. 

Recently,  Marcus,  Willsky,  and  Lo  ( 11] -[ 15]  have  derived  finite 
dimensional  optimal  nonlinear  estimators  for  certain  classes  of  nonlinear 
systems  with  white  Gaussian  plant  and  observation  noise.  The  existence 
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of  such  optimal  estimators  allows  us  to  investigate  how  both  optimal  and 
suboptimal  estimators  process  information,  and  to  examine  the  differences 
between  them.  The  classes  of  systems  for  which  finite  dimensional 
estimators  are  implementable  are  described  by  certain  polynomial 
nonlinearities  or  by  certain  classes  of  Volterra  series  expansions.  One 
class  of  models  considered  in  [11]-[12]  is  described  by  the  Ito 
equations 


dx(t)  = F(t)x(t)dt  + G(t)dw(t) 

(1) 

N 

y(t)  = (A0  + l x . ( t) A . )y( t) 
u 1=1  1 1 

(2) 

dz(t)  = H(t)x(t)dt  + dv(t) 

(3) 

where  x(t)  is  an  n-vector,  y(t)  is  a k-vector  or  kxk  matrix,  {A,.}  are 
kxk  matrices,  w and  v are  independent  Brownian  motion  (Wiener)  processes, 
and  x(0)  is  Gaussian.  The  optimal  estimate,  with  respect  to  a wide 
variety  of  criteria,  of  x(t)  given  the  observations  z = (z(s),  0£S£t), 
is  the  conditional  mean  x(t|t)  (also  denoted  by  Et[x(t))  or  E[x(t) J z1] ) 

[4).  It  is  well  known  [3], [4]  that  the  computation  of  x(t|t)  can  be 
performed  by  the  finite  dimensional  (linear)  Kalman-Bucy  filter.  However, 
the  computation  of  y(t|t)  requires  in  general  an  Infinite  dimensional 
system  of  stochastic  differential  equations.  In  [12],  Marcus  and  Wlllsky 
have  proved  that  y(t|t)  Is  computable  with  a recursive  finite  dimensional 
estimator.  If  the  Ideal  generated  by  AQ  in  the  Lie  algebra  {Aq.Aj,. . . .A^)^ 
Is  nllpotent.  Using  Volterra  series  expansions  116]- [17] , Marcus  and 
Wlllsky  have  also  proved  a similar  result  for  systems  described  by  (1),(3), 
and  in  which  y(t)  Is  given  by  a certain  type  of  Volterra  series  expansion 
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or  by  polynomial  type  feed-forward  nonlinearities  [11]-[12],  as 
Illustrated  by  the  following  system: 


[ I 


i 


dxj(t) 

dx,(t) 


-a 


0 

o -b 


Xj(t) 

x2(t) 


dt  + q' 


dWj(t) 

dw,(t) 


dy(t)  = (-y  y(t) + Xj(t)x2(t))dt 


dZj(t) 

Xj(t) 

dt  + r** 

dVj(t) 

dz2(t) 

x2(t) 

dv2(t) 

= F x(t)dt  + q*5dw(t)  (4) 


(5) 


(6) 


where  a,  3,  y>0,  Wj,  Wg»  v^,  and  v2  are  independent,  zero  mean,  unit 
variance  Wiener  processes,  and  x^(0),  x2(0),  and  y(0)  are  independent 
Gaussian  random  variables  which  are  also  independent  of  the  noise 
processes.  Here  (5)  Involves  a polynomial  feed-forward  nonlinearity. 

In  section  II  we  construct,  for  the  system  (4)-(6),  the  optimal 
estimator  and  three  suboptimal  estimators:  the  extended  Kalman  filter 
(EKF),  the  constant  gain  extended  Kalman  filter  (CGEKF),  and  the  best 
linear  estimator  (BLE,  also  known  as  the  linear  minimum-variance 
estimator).  Section  III  is  concerned  with  the  derivations  of  error 
covariance  propagation  equations  for  both  optimal  and  suboptimal 
estimators  and  a lower  bound  based  on  that  of  Bobrovsky  and  Zakai  [10] 
for  the  system  (4)-(6).  The  existence  of  the  optimal  estimator  allows 
a direct  comparison  of  the  lower  bound,  the  error  covariance  of  the 
optimal  estimator,  and  the  error  covariance  of  suboptimal  estimators  for 
various  parameter  values  and  slgnal-to-nolse  ratios.  Results  of 
Monte-Carlo  simulations  are  presented  In  Section  IV. 


V. 

II.  Optimal  and  Suboptimal  Estimators 


The  optimal  finite  dimensional  estimator  for  the  system  (4)-(6)  Is 
constructed  as  follows  [11]- [12] - First,  the  state  x of  the  linear 
system  (4)  is  augmented  with  the  state  n satisfying 


nj(t) 

a - Y - Pjj(t)  0 

nj(t) 

x2(t) 

= 

+ 

n2(t) 

0 B-Y-P^U) 

n2(t) 

Xj(t) 

n^O)  = n2(0)  = 0 (7) 

where  P^(t),  i = 1,2,  are  the  elements  of  the  (nonrandom)  Kalman-Bucy 
filter  error  covariance  matrix  P(t)  for  the  linear  system  (4), (6)  (P(t) 
is  obviously  diagonal).  The  Kalman-Bucy  filter  for  the  linear  system 
(4), (7),  with  observations  (6),  computes  the  conditional  expectations 
x(t|t)  and  n( t | t) . Finally,  y(t|t)  is  computed  according  to 

dy(t|t)  * [-yy(t|t)  +x1(t|t)x2(t|t)]dt+  ^n'(t|t)P(t)dv(t) 

* (8) 

y(0|0)  = E[y(0)] 

where  the  Innovations  process  v Is  given  by 

dv(t)  * dz(t)  - x(t|t)dt,  (9) 

dz(t)  = [dz1(t),dz2(t)l',  and  the  prime  denotes  matrix  transpose.  Hence, 
the  structure  of  the  nonlinearity  In  (5)  allows  the  computation  of  y(t|t) 
with  a five-state  (x,n,y)  estimator. 

The  existence  of  a nonlinear  time-invariant  steady-state  estimator 
for  this  problem  Is  also  demonstrated  In  [12].  The  assumption  which 
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assures  this  result  is  the  controllability  and  observability  of  the 
original  linear  system  (4), (6),  and  some  additional  derivations  show 
the  existence  of  a steady-state  estimator  for  n and  y. 


The  EKF  [4]  for  the  state  of  the  nonlinear  component  of  the  system 
(4)-( 6)  is  given  by 


dy(t|t)  = -Yy(t|t)  + x1(t|t)x2(t|t)  dt+£  Kj(t) ,K2(t)Jdv(t) 

y(0|0)  = E[y(0)] 

(10) 

where  the  "gain"  terms  satisfy 

KjU)  = [-a-y- FP11(t)]K1(t)  + P11(t)X2(t|t) 

(11) 

K2(t)  = [-B-y- 7P22(t)]K2(t)  + P22(t)i1(t|t) 

(12) 

Kj(0)  » Kg( 0)  = 0,  Xj(t|t)  and  Xg( t | t)  are  generated  by  the  Kalman-Bucy 
filter,  and  dv(t)  Is  given  in  (9).  Notice  that  the  EKF  (10)  and  the 
optimal  estimator  (8)  differ  only  in  their  gain  terms.  The  question  of 
whether  constant  gains  In  (10)  will  suffice  Is  also  studied  (see 
Section  IV),  by  utilizing  the  constant  gain  extended  Kalman  filter  (C6EKF) 
of  Safonov  and  Athans  [18];  however,  their  results  are  not  strictly 
applicable  to  the  system  (4)-(6),  because  polynomial  nonllnearltles 
do  not  satisfy  the  finite  incremental  gain  conditions  of  [18], 

Now  consider  the  best  linear  estimator  (BLE).  Such  an  estimator  can 
be  constructed  by  first  finding  a new  state  equation  which  Is  linear,  with 
additive  white  Gaussian  noise,  and  has  the  same  first  two  moments  (mean 
and  covariance)  as  those  of  the  original  system  (4)-(5);  then  the  Kalman- 
Bucy  filter  for  the  new  linear  state  and  the  observation  (6)  computes 
the  BLE  ([19],  p.  152).  Let  x » [Xj,x2,y]'  be  the  new  state.  We  first 


compute  the  mean  and  covariance  propagation  equations  for  the  original 
state  [x'.y]'.  Let  m^(t)  * E[Xj(t)],  mx^(t)  - E[x2(t)],  and 


my(t)  = E[y(t)].  Then, 

m (t)  = -am  (t)  ; 

X1  X1 

mx^(0)  = E [Xj ( 0) ] 

(13) 

m (t)  = -6m  (t) 

*2  a 2 

■x  (°)  = E[x2(0)] 

(14) 

m (t)  = -ym(t)+m  (t)m  (t); 

J y a i*o 

my(0)  = E[y(0)] 

(15) 

Let  r(t)  be  the  covariance  matrix  of  [x'.y]'.  (r  Is  synmetrlc  and 
rv  =0).  Applying  Ito's  differentiation  rule  [4]  to  the  elements  of 

XjX2 

r(t)  and  taking  expectations  in  the  resulting  differentials,  we  have 


rx  x (t)  = q " 2“rx  x <*> 

xlx  1 Vl 

t x (t)  = q - 20r  (t) 


ry  (0)  = var(x.(0) ) (16) 

XjXj  1 


; rx2x2{0)  " var(x2(0))  <17) 


V*’  = (-on)rXiy(t)-X2(t)rXiXi(t);  rx^y(o)  = o 


(18) 

(19) 

(20) 


t v(t)  = (-en)r„  v(t)  + m (t)r  (t);  r (o)  = o 

fyy(t)  = -2yryy(t) +2E[y(t)Xj(t)x2(t)]  - 2mx^(t)mx^(t)my(t) 

ryy(0)  = var(y(0)) 

The  expectation  of  x^( t)x2( t)y( t)  In  (20)  Is  computed  as  follows. 
E[y(t)Xj(t)x2(t)]  = Ej[e"Yt  y(0)+  ^ e'T^“T^  XjfxJx^xJdxjxjUJx^t) 

* e“Yt  my(0)mx  (t)mx  (t)  + / e‘Y(t"x)  E[Xj( t)Xj(x)] 

• E[x2(t)x2(x)]dx 


$ e"Yt  mu(0)m  (t)my  (t)+Q(t) 


y'~' *i  x2 


(21) 


7 


where  Q(t)  can  be  expressed  in  differential  form 


Q(t)  = -(a+B+Y)Q(t)  + [m^(t)+rXiXi(t)]  [m^(t)  + rx2x2(t)] 

Q( o)  = 0 (22) 

Substituting  (21)  into  (20),  we  have,  if  y^a+B, 

/ ■"*  (t)mx  (t)  ) 

yt)  - -2rryy(t)^|Q(t)-  i—±  [i-e-(v-«-S)t]  | 

= -2yryy(t)+2Q(t)  (23) 

and  Q(t)  = Q(t)-mj;  (O)m^  (0)t  e‘^Y+a+8^t,  if  y = a+B.  The  linear  state 
X1  x2 

equation  for  x is  thus  formulated  to  be 


dx(t)  = Fx(t)dt + G(t)dw(t)  + 


I 


where 


-a  0 0 

1 

q*5  0 0 

F = 

0 -B  0 

. GCt)  - 

0 q*4  0 

i 

o 

o 

1 

0 0 \/2Q(t) 

(24) 


and  w(t)  is  a Brownian  motion  of  dimension  3 with  Q(t)dt * E(dw(t)dw' ( t) > . 
Moreover,  G(t)Q(t)G'(t)  should  satisfy 


6(t)Q(t)G'(t) 


q 

0 


0 

q 


mx  ^t)rx  x 
x2  xlxl 

m (t)r  (t) 

A j *2^2 


n (t)r  (t)  m (t)r  (t)  2Q(t) 

*2  *il  A1  a2a2 


(25) 


Then  the  mean  and  covariance  propagation  equations  for  x(t)  have  the 


■ 

I . 

same  fonns  as  those  given  in  equations  (13) -(19), (23).  Observe  that 
in  (24),  Xj(t)  * Xj(t)  and  x2(t)  = x2(t);  only  the  nonlinear  component 
of  (5)  has  been  altered.  The  BLE  is  the  Kalman-Bucy  filter  for  the 
system  (24)  with  observation  (6). 


III.  Performance  of  Estimators  and  Estimation  Lower  Bound 


In  this  section  we  shall  derive  the  error  covariance  propagation 
equations  associated  with  the  optimal  and  suboptlmal  estimators.  We  are 
Interested  In  computing  the  error  covariance  of  y,  since  the  error 
covariance  of  x Is  readily  implemented  via  the  Kalman-Bucy  filter. 

Though  many  lower  bounds  have  been  proposed  to  facilitate  the  evaluation 
of  suboptimal  estimators,  for  example,  Snyder  and  Rhodes  [5], [6]  for 
Gaussian  state  processes,  Gilman  and  Rhodes  [7], [8]  for  systems  with 
cone-bounded  nonlinearities,  and  Zakai  and  his  colleagues  [9], [10]  for 
fairly  general  problems  in  both  discrete  and  continuous  time,  only 
the  lower  bound  derived  by  Bobrovsky  and  Zakai  [10]  is  applicable  to 
the  system  (4)-(6).  i,We  shall  investigate  the  tightness  of  Bobrovsky 
and  Zakai 's  lower  bound  on  estimator  performance  — that  is,  how  close 
is  the  lower  bound  to  the  covariance  of  the  optimal  estimator? 

The  error  covariance  of  y is  the  expected  value  of  the  conditional 
error  covariance  E[(y(t)  - y(t|t))^|zt].  Taking  the  expectations  in  the 
equation  ([4],  equation  (6.100))  satisfied  by  the  conditional  error 
covariance,  we  obtain  the  error  covariance  propagation  equation 
associated  with  y(t|t): 

i(t)  * -2yE(t)  + 2E  [Et[y(t)x1(t)x2(t)]-Et[y(t)]Et[x1(t)x2(t)]] 

- £ E [rf(t)  + z|(t)] 

E(0)  » var(y(0))  (26) 

where  E^t)  * E[(Xj(t)  - xi(t|t))(y(t)  - y( 1 1 1) ) |zt],  1*1,2.  In  the 
sequel,  we  assume  that  Xj(0)  and  x2(0)  are  independent,  zero-mean. 
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Gaussian  random  variables.  It  is  easy  to  show  that  Ij(t)  = Z2(t)  « 0, 
for  Xj(t)  and  x2(t)  are  independent,  zero-mean,  Gaussian  processes. 

The  other  expectation  in  (26)  is  computed  as  follows: 

Et[y(t)x1(t)x2(t)]  - Et[y(t)j  Et[x1(t)x2(t)] 

= jf  Je1  [x1(s)x2(s)x1(t)x2(t)] 

"Et[x1(s)x2^s)J  Et[x1( t)x2( t)J  Jds 

= jf  e‘Y^t~s^P11(s,t,t)x2(s|t)x2(t|t)  + P22(s,t,t)x1(s  | tJXjCtlt) 
+P11(s,t,t)P22(s,t,t)j  ds 

= P11(t)n1(t|t)x2(t|t)  + P22(t)n2(t|t)x1(t|t) 

+ f e~Y(t-s)  Rj^s.t.tJPggts.t.tJds  (27) 

where  P^s.t.t)  = E[(xi(s)  - xi(s 1 1) )(Xj(t)  - x^(t 1 1) ) | zfc] , i,j«l,2,  are 
the  nonrandom  conditional  cross-covariances  defined  in  [12,  Lemma  2.1]; 
here,  P12(s,t,t)  = P21(s,t,t)  =0.  A crucial  component  in  computing  (27) 
is  the  use  of  [12,  Lemma  B.l],  which  expresses  the  higher  order  moment 
Et[x1(s)x2(s)x1(t)x2(t)]  of  a Gaussian  distribution  in  terms  of  lower 
order  moments.  We  need  only  compute  the  expected  values  of  n1(t|t)x2(t|t) 
and  n2(t|t)Xj(t|t),  since  the  last  integral  in  (27)  is  nonrandom. 

The  state  equation  of  £(t|t)  = [x'(t|t),n,(t|t)] ' is  the  Kalman-Bucy 
filter  for  the  augmented  system  (4), (7)  with  observation  (6).  | 
satisfies 

d£(t|t)  - C(t)  Z(t\t)  +£  D(t)dv(t) 

11 


(28) 


where 


-a  0 0 0 

su(t)  0 

C(t)  « 

o 

r* 

I 

o 

CQ 

1 

O 

. D(t)  = 

0 $22( t) 

0 1 a-Y-qPjJft)  0 

o s23(t) 

1 0 0 6-Y-qP22(t) 

S14(t)  0 

and  S(t)  Is  the  conditional  error  covariance  matrix  of  £(t)  given  zt;  it 
satisfies  the  Riccati  equation,  and  S^ft)  = P^t)  and  S22(t)  = P22^^' 
Since  the  innovations  process  v is  a standard  Brownian  motion  process 

A 

with  unit  variance  [2],  the  covariance  <J>(t)  of  C is  easily  computed 
according  to 

*(t)  - C(t)*(t)  + $(t)C,(t)  + i D(t)D’(t)  . (29) 

E[n1(t|t)x2(t|t)]  and  E[n2(t|t)Xj(t|t)]  are,  respectively,  the  (2,3)  and 
(1,4)  elements  of  4>(t).  Expressing  the  last  integral  in  (27)  in 
differential  form,  we  have 

jf*  e'Y(t_s)  Pn(s,t,t)P22(s,t,t)ds  = Pn(t)P22(t)6(t) 
where  6(t)  satisfies 

6(t)  = l+^a+B-y-qPjJ(t) -qP22(t)j6(t);  6(0)  = 0 (30) 

Finally,  equation  (26)  is  rewritten  as 

Z(t)  = -2YE(t)  + 2{p11(t)*23(t)  + P22(t)*14(t)+P11(t)P22(t)6(t)} 

* -2yZ(t)+20(t)  (31) 

where  e(t)  ^0  for  all  t^O,  Is  a function  depending  on  the  system 
parameters  (a.B.y)  and  noise  covariances  (q  and  r).  As  shown  in  [12], 
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the  augmented  linear  system  (4), (7)  is  time-invariant  in  steady  state 
and  S(t)  has  a unique  positive-definite  steady-state  solution  S. 
Similarly,  equations  (29), (30)  have  steady-state  solutions  4>  and  6, 
respectively,  where  4>  is  positive  definite  and  6 is  positive.  Hence, 
at  steady  state,  the  solution  of  (31)  is  1 = e/y. 

The  lower  bound  [10]  for  the  system  (4) -( 6)  is  obtained  by  applying 
the  Kalman-Bucy  filter  to  the  following  linearized  system 


dh(t)  = A(t)h(t)dt + G(t)dw(t) 

(33) 

dk(t)  = B(t)h(t)dt + r*  dv(t) 

(34) 

where  h(t)  and  k(t)  are  3-  and  2-dimensional  vectors,  respectively,  and 
w(t)  and  v(t)  are  independent  standard  Brownian  motions  of  dimensions  3 
and  2,  respectively.  A(t),  B(t),  G(t)  are  3><3,  2x3,  3x3  matrices, 
respectively: 


-a  0 0 

q 0 0 

A(t)  = 

0 

1 

XX> 

O 

ii 

eEf 

•k 

0 q 0 

0 0 -Y 

0 0 0 

and  B(t)  is  the  solution  of 


0 

0 

0 


h(0)  is  assumed  to  be  a zero-mean,  Gaussian  random  vector  with  nonsingular 
variance,  and  k(0)  = 0. 

The  lower  bound  ELB(t)  of  Bobrovsky  and  Zakai,  which  lower  bounds 
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the  error  covariance  of  any  estimator  of  y,  is  the  (3,3)  element  of  the 
error  covariance  matrix  for  the  system  (33)-( 34) . ZLB(t)  satisfies 

ELB(t)  = -2YILB(t)  (35) 


Comparing  (31)  and  (35),  we  have  Z^B(t)  < Z(t)  for  all  t,  as  expected. 

The  bound  does  not  depend  on  the  measurement  noise  covariance  r;  for  some 
values  of  r it  is  far  from  the  optimal  error  covariance  (see  Section  IV), 
since  e(t)  in  (31)  is  nonzero  and  can  in  fact  be  large.  This  demonstrates 
that  the  lower  bound  of  Bobrovsky  and  Zakai  can  be  quite  loose  in  some 
cases.  This  was  to  be  expected,  since  the  bound  is  based  upon  the 
Cramer-Rao  lower  bound,  which  can  be  loose  if  the  signal -to-noise  ratio 
is  small  [20].  However,  in  steady-state  Z(t)  -*■  ZLg(t)  as  r-*-0,  which  can 
be  shown  as  follows.  Based  on  the  discussions  following  (31),  the  steady 
state  solution  of  Z(t)  is 


E = Y = Y {Pll*23  + P22*14  + P11P22  6 } 

where 

P11  = -ar+/x2r2  + qr  , = -fcsr  + ^32r2  + qr  , 

s , P11P22 s . P11P22 

14  (PllP22/r*  + (°1"&+T')p22  + q 23  (pup22/r^  + ^_a+8+Y^Pll + q ’ 
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(PllP22/r)+2g(PllP22/r)S23 
2B[(-a+B+Y)Pn  + ql 


» *14 


(pHP22/r)  + 2ot(pnP22^r^S14 

2a[(a-B+Y)P22  + ql 


and 


-o-e+Y+q(l/Pn  + l/P22)  ‘ 


We  can  easily  see  that  as  r-*0,  I -*•  z^g(=0);  thus,  at  least  at  steady 
state,  this  result  confirms  the  prediction  by  Bobrovsky  and  Zakal  [10]. 

The  EKF  generates  an  "approximation"  E(t)  to  the  conditional  error 
covariance  E[(y(t) -y(t|t))2|zt],  which  Is  given  by 

Z(t)  = -2yZ(t)  + 2K1(t)x2(t|t)  + 2K2(t)x1(t|t)  - CKj(t)]2  - [K2(t)]2  (36) 

Taking  expectations  In  (36),  we  find  that  the  EKF  approximation  E[Z(t)] 
to  the  error  covariance  E[(y(t)  - y(t|t)  ] satisfies 

^ E[Z(t)]  = -2yE[Z(t)]  (37) 

Thus  Z^g(t)  = E[Z(t) ] < I(t),  indicating  that  E[Z(t)]  may  not  be  a very 
good  approximation  to  the  error  covariance  E[(y(t)  - y( 1 1 1) ) ] of  the  EKF; 
for,  since  y(t|t)  of  (8)  is  the  optimal  mean-square  error  estimate,  it  Is 
clear  that  In  fact  E(t)  < E[(y(t)  - y(t|t))2].  Hence,  as  Is  well  known, 
one  should  exercise  caution  in  using  E[E(t)]  as  an  indicator  of  EKF 
performance 

Denoting  E(t)  = E[(y(t)  -y(t|t))2],  the  error  covariance  matrix 
associated  with  y(t|t)  of  the  BLE,  we  have 
z _ . z,(t)  + L(t)  . 

Z(t)  * -2yZ(t)  + 2Q(t)  j— ; 1(0)  * var(y(0))  (38) 

where 

t _ p..(t)L(t) 

2j(t)  « -(o+Yjl^t)  +mx^(t)rx^x^(t)  - — ; Ej(0)  - 0 (39) 

1 _ Poo(t)Z,(t) 

Z2(t)  * -(B+Y)E2(t)+mx  (t)rx  x (t)  - — - --  ; Z2(0)  « 0 (40) 

1 2 2 

and  Ei(t)  - E[(x1(t)-x1(t|t))(y(t)-y(t|t))J,  1-1.2. 
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S 


Consider  now  the  stability  of  ( 38) -(40) . First,  as  t-*-®,  m (t)  and 

X1 

mv  (t)  approach  zero.  Also,  since  (4), (6)  Is  controllable  and  observable, 
x2 

Pj^t)  and  P22(t)  approach  positive  constant  values.  Thus  [24,  p.  113], 
EjU)  and  E2( t)  approach  zero  as  t-*-®.  Comparing  (38)  and  (23),  we 
find  that  E(t)  has  the  same  performance  as  that  of  r (t)  (the  a priori 
covariance)  as  t+®.  Furthermore,  If  Xj(0)  and  x2(0)  are  zero-mean, 
mv  (t),  mv  (t),  E. ( t) , and  E,(t)  will  be  zero  for  all  t>0.  Thus,  the 

1 b — 

observation  process  z(t)  Is  uncorrelated  with  y(t)  In  this  case,  and 
y(t|t)  is  just  the  a priori  mean  my(t);  that  is,  the  BLE  for  y(t)  does  not 
take  advantage  of  the  observations  during  the  course  of  estimation.  The 
error  covariance  E(t)  of  the  BLE  Is  also  the  a priori  covariance  r^t) 
which,  at  steady  state.  Is 

q2 

r = r = 2 (41^ 

yy  4a3Y(a+e+Y) 

Although  In  this  case  y(t)  Is  uncorrelated  with  z(t).  It  Is  not  Independent 
of  z(t);  hence  the  optimal  estimator  which  computes  the  conditional  mean 
y(t|t)  can  In  fact  perform  quite  well,  as  Is  demonstrated  In  Section  IV. 

It  should  also  be  pointed  out  that  for  the  system  (4)-(6)  the  optimal 
estimator  Is  also  the  best  quadratic  estimator;  that  Is,  It  Is  the  best 
estimator  whose  Input/output  map  can  be  expressed  as  a Vol terra  series  of 
order  2 with  Input  as  the  Innovations  process  v(t)  (see  [21]). 

In  [10]  Bobrovsky  and  Zakal  showed  that  for  a scalar  problem  and  for 
sufficiently  small  values  of  r,  optimal  linear  filtering  yielded  practically 
the  same  filtering  error  as  that  of  the  optimal  nonlinear  filtering.  This 
Is  not  the  case  for  our  multivariable  problem.  In  fact,  for  sufficiently 
large  q and  small  r,  E Is  much  greater  than  E and  E^  (see  Section  IV). 
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IV.  Simulation  Results 


In  order  to  compare  and  evaluate  the  performance  of  the  optimal 
estimator,  the  EKF,  the  CGEKF,  the  BLE,  and  the  lower  bound,  digital 
Monte  Carlo  simulations  were  employed.  In  this  section  It  Is  assumed  that 
Xj(0)  and  x(0)  have  zero  mean.  Since  the  error  covariance  propagation 
equations  for  the  optimal  estimator  (31),  the  BLE  (38),  and  the  lower 
bound  (35)  are  ordinary  differential  equations,  they  were  computed  off-line 
and  stored.  Simulations  were  conducted  on  the  EKF  (10),  the  CGEKF,  and 
the  optimal  estimator  (8).  Identical  noise  sequences  were  used  to  allow 
direct  comparison.  Our  approach  to  the  statistical  analysis  of  the 
Monte  Carlo  simulations  parallels  that  of  Bucy  and  his  associates  [22]. 

The  mean-square  error 

y - E[(y(t)-y*(t))2]  (42) 

where  y*(t)  denotes  the  estimate  (e.g.,  y(t|t)  or  y(t|t)),  was  used  as 
the  performance  measure  In  the  simulation.  Suppose  that  {y*>  and  {yn> , 
n*l,...,N,  are  sequences  of  Independent  realizations  of  y*( t)  and  y(t), 
respectively.  Then  the  statistic 

s* » jl(yn-ys)2  m 

Is  an  approximation  to  p for  sufficiently  large  N. 

In  the  experiments,  the  parameters  were  varied  as  follows: 
a,  6.  Y * 0.9 
q * 2 

r • 0.04,  0.2  and  1 
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75  sample  paths,  each  of  which  contained  5000  steps  of  length  0.001s,  were 
run  In  each  simulation.  The  constant  gains  In  the  CGEKF  were  chosen  to  be 
the  average  gain  values  (the  gains  In  (10))  over  75  sample  paths;  both 
gain  values  were  quite  small  (we  also  simulated  the  zero-gain  EKF;  the 
simulation  result  was  almost  the  same  as  that  for  nonzero  gains).  Figs.  1-3 
display,  for  three  cases,  graphs  of  mean-square  error  (averaged  over  75 
sample  paths)  of  the  optimal  estimator,  the  suboptlmal  estimators  (EKF, 

CGEKF,  BLE),  and  the  lower  bound. 

As  expected,  the  steady  state  optimal  error  covariance  approaches  the 

lower  bound  for  very  small  values  of  r (=0.04)  and  Is  greater  than  the 

lower  bound  for  large  r (=1).  The  BLE  remains  unchanged  in  the  three 

figures,  since  It  Is  just  the  a priori  mean.  The  EKF  performance  Is  quite 

close  to  the  simulated  optimal  error  covariance  in  every  simulation  run; 

further  simulations  [23]  have  Indicated  that  the  gain  terms  (see  (8)  and 

(10))  of  the  optimal  estimator  and  the  EKF  are  almost  equal.  The  performance 

of  the  CGEKF  is  somewhat  less  effective  than  that  of  the  EKF,  but  Is  far 

superior  to  the  BLE  performance.  The  estimate  ?(t|t)  of  the  CGEKF  Is  the 

result  of  passing  the  a posteriori  means  Xj(t|t)  and  x2(t|t)  through  the 

nonlinear  filter  (10)  with  gains  set  to  near-zero  values,  while  the  y(t|t) 

of  the  BLE  results  from  passlnq  the  a priori  means  m..  (t)  and  m..  (t)  through 

X1  x2 

(15).  This  of  course  explains  why  the  CGEKF,  which  utilizes  the  observations, 
has  better  performance  than  the  BLE.  It  should  be  noted  that  the  difference 
between  the  averaged  mean-square  error  of  the  optimal  estimator  and  the 
actual  optimal  error  covariance  Is  due  to  the  fact  that  averaging  over  75 
sample  paths  does  not  give  a good  approximation  to  the  expectation. 
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Figure  1.  Mean-square  error: 


Figure  2.  Mean-square  error: 


Figure  3.  Mean-square  error: 


V.  Concluding  Remarks 


A nonlinear  system  with  a finite  dimensional  optimal  nonlinear 
estimator  has  been  considered.  Based  on  such  a system,  optimal  and 
suboptlmal  estimators  and  an  estimation  lower  bound  have  been  studied. 

The  optimal  estimator  Is  used  as  a criterion  In  evaluating  the  performance 
of  the  suboptimal  estimators  and  the  lower  bound.  Simulation  results 
indicate  that  the  performance  of  the  EKF  Is  as  good  as  that  of  the 
optimal  estimator,  and  the  Bobrovsky  and  Zakal  lower  bound  is  tight  for 
very  high  signal -to-nolse  ratio  (r->-0)  but  is  less  effective  for  large 
values  of  state  and  observation  noises.  As  far  as  suboptimal  filter  design 
Is  concerned,  the  CGEKF  is  probably  preferable,  in  most  of  the  cases 
studied,  to  the  optimal  estimator  and  the  EKF,  due  to  its  simple 
computational  requirements.  In  fact,  a naive  estimator  which  passes 
Xj(t|t)  and  XgUlt)  through  a model  of  the  nonlinear  system  (5)  performs 
almost  as  well. 


References 


1.  H.  J.  Kushner,  "Dynamical  equations  for  optimal  nonlinear  filtering," 

*/.  Differential  Equations , vol . 3,  pp.  179-190,  1967. 

2.  M.  Fujisakl,  G.  Kallianpur,  and  H.  Kunlta,  "Stochastic  differential 

equations  for  the  nonlinear  filtering  problem,"  Osaka  J.  Math., 
vol.  9,  pp.  19-40,  1972. 

3.  R.  E.  Kalman  and  R.  S.  Bucy,  "New  results  In  linear  filtering  and 

prediction  theory,"  J.  Basic  Engr.  (Tran.  ASME ),  vol.  83D, 
pp.  95-108,  1961. 

4.  A.  H.  Jazwlnskl,  Stochastic  Processes  and  Filtering  Theory. 

New  York:  Academic  Press,  1970. 

5.  D.  L.  Snyder  and  I.  B.  Rhodes,  "Filtering  and  control  performance 

bounds  with  Implications  on  asymptotic  separation,"  Automatica , 
vol.  8,  pp.  747-753,  1972. 

6.  I.  B.  Rhodes  and  D.  L.  Snyder,  "Estimation  and  control  performance 

for  space-time  point-process  observations,"  IEEE  Transactions  on 
Automatic  Control,  vol.  AC-22,  pp.  338-346,  June  1977. 

7.  A.  S.  Gilman  and  I.  B.  Rhodes,  "Cone- bounded  nonl Inearltles  and 

mean-square  bounds,"  IEEE  Transactions  on  Automatic  Control , 
vol.  AC-18,  pp.  260-265,  June  1973. 

8.  I.  B.  Rhodes  and  A.  S.  Gilman,  "Cone-bounded  nonlinearities  and 

mean-square  bounds:  Estimation  lower  bounds,"  IEEE  Transactions 
on  Automatic  Control,  vol . AC-20,  pp.  632-642,  October  1975. 

9.  M.  Zakal  and  J.  Zlv,  "Lower  and  upper  bounds  on  the  optimal  filtering 

error  of  certain  diffusion  process,"  IEEE  Transactions  on 
Information  Theory , vol . IT-18,  pp.  325-331,  May  1972. 

10.  B.  Z.  Bobrovsky  and  M.  Zakal,  "A  lower  bound  on  the  estimation  error 

for  certain  diffusion  process,"  IEEE  Transactions  on  Information 
Theory,  vol.  IT-22,  pp.  45-52,  January  1976. 

11.  S.  I.  Marcus,  "Estimation  and  analysis  of  nonlinear  stochastic 

systems,"  Ph.D.  dissertation.  Dept,  of  Electrical  Engineering, 
M.I.T.,  Cambridge,  MA,  June  1975. 

12.  S.  I.  Marcus  and  A.  S.  Will  sky,  "Algebraic  structure  and  finite 

dimensional  nonlinear  estimation,"  SIAM  J.  Math.  Analysis, 
vol.  9,  pp.  312-327,  April  1978. 


23 


13.  A.  S.  Will  sky  and  S.  I.  Marcus,  "Estimation  of  bilinear  stochastic 

systems,"  Variable  Structure  Systems  with  Applications  to 
Economics  and  Biology , R.  R.  Mohler  and  A.  Ruberti,  eds.. 

New  York:  Sprlnger-Verlag,  pp.  116-137,  1975. 

14.  J.  T.  Lo  and  A.  S.  Willsky,  "Estimation  for  rotational  processes  with 

one  degree  of  freedom  I:  Introduction  and  continuous  time 
processes,"  IEEE  Transactions  on  Automatic  Control,  vol . AC-20, 
pp.  10-21,  1975. 

15.  A.  S.  Willsky,  "Some  estimation  problems  on  Lie  groups,"  Geometric 

Methods  in  System  Theory,  R.  W.  Brockett  and  D.  Q.  Mayne,  eds., 
Reldel,  The  Netherlands,  1973. 

16.  R.  W.  Brockett,  "Volterra  series  and  geometric  control  theory," 

Automatica , vol.  12,  pp.  167-176,  1976. 

17.  R.  W.  Brockett,  "Nonlinear  syst  ms  and  differential  geometry," 

Proceedings  of  the  IEEE,  vol.  64,  pp.  61-72,  January  1976. 

18.  M.  G.  Safanov  and  M.  Athans,  "Robustness  and  computational  aspects 

of  nonlinear  stochastic  estimators  and  regulators,"  IEEE 
Transactions  on  Automatic  Control,  vol . AC-23,  pp.  717-725, 

August  1978. 

19.  R.  S.  Liptser  and  A.  N.  Shiryayev,  Statistics  of  Random  Processes  II. 

New  York:  Sprlnger-Verlag,  1978. 

20.  D.  J.  Sakrlson,  Rotes  on  Analog  Cormunication.  New  York: 

Van  Nostrand  Reinhold,  1970. 

21.  S.  I.  Marcus,  S.  K.  Mltter,  and  D.  Ocone,  "Finite  dimensional 

nonlinear  estimation  in  continuous  and  discrete  time,"  presented 
at  the  International  Conference  on  Analysis  and  Optimization  of 
Stochastic  Systems,  Oxford,  September  6-8,  1978. 

22.  R.  S.  Bucy,  C.  Hecht,  and  K.  0.  Senne,  "An  engineer's  guide  to 

building  nonlinear  filters,"  F.  »1.  Seller  Research  Lab., 

Project  7904-00-37,  May  1972. 

23.  C.-H.  Liu,  "A  comparison  of  optimal  and  suboptlmal  estimators  and 

estimation  lower  bounds,"  M.S.  Thesis,  Dept,  of  Electrical 
Engineering,  University  of  Texas  at  Austin,  August  1978. 

24.  J.  L.  Willems,  Stability  Theory  of  Dynamical  Systems.  New  York: 

Wiley,  1970. 


SECURITY  CLASSlFpS>*f5M.pr  THIS  PAGE  fl»hmn  Dele  Entered) 

RIMPT  DOCUMENTATION  PAGE  | before** completwg^orm 

I urrsnt  iiMnirlJTr  i n i n r i r inn  nn  "P^ftrriririif-  r i t i n r 

AFOSR/ira-5^4Llii  4/1  r 97 


1 4.  TITLE  (end  Subtitle) 


Estimator  Performance  for  a Class  of  Nonlinear  / Interim 
Estimation  Problems ' 9 *■■...  : ” 

^ — I 11  Si  Will] 


CONTRACT  OR  GRANT  NUMBER^) 


Oo\  Chang-Huann.iu  ggg  Steven  I./larcus 


19.  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 


The  University  of  Texas  at  Austin 
Department  of  Electrical  Engineering/ 
Austin,  Texas  78712 


10.  PROGRAM  ELEMENT.  PROJECT.  TASK 
AREA  & WORK  UNIT  UUMAERS 


111.  CONTROLLING  OFFICE  NAME  ANO  ADORESS 


Air  Force  Office  of  Scientific  Research/NFf/*^  Mar* 
Bolling  AFB,  Washington,  D.C.  20332 


4.  MONITORING  AGENCY  NAME  & AODRESSf/f  dll  let opt  from  Controlling  Of  lice)  MS.  SECURITY  CLASS,  (ol  thle  roport) 


Unclassified 


15m.  DECLASSIFICATION/ DOWNGRADING 
SCHEDULE 


1 16.  DISTRIBUTION  STATEMENT  (ol  thla  Report) 


Approved  for  public  release;  distribution  unlimited. 


17.  DISTRIBUTION  STATEMENT  (of  the  ebetrect  entered  In  Block  20,  II  dlllerent  from  Report) 


10.  SUPPLEMENTARY  NOTES 


19.  KEY  WORDS  (Continue  on  rereree  elde  II  neceeemry  end  Identify  by  block  number) 

Nonlinear  Estimation,  Estimation  Lower  Bounds. 


" ABtt’RACT  ( Continue  on  reveree  elde  II  neceeemry  end  Identify  by  block  number) 

The  state  estimation  problem  for  a certain  class  of  nonlinear  stochastic 
systems  with  white  Gaussian  plant  and  observation  noise  is  considered.  The 
optimal  (minimum  variance)  estimators  for  these  systems  are  recursive  and 
finite  dimensional.  A particular  nonlinear  system  which  contains  a polynomial 
nonlinearity  Is  presented.  Both  optimal  and  suboptlmal  estimators  and  an 
estimation  lower  bound  for  such  a system  are  derived.  The  performance  of  the 
optimal  and  suboptlmal  estimators  and  the  lower  bound  are  compared  both 
analytically  and  by  computer  simulation. 


oo  M7J 


9?/ 


\ UNCLASSIFIED 

SECURITY  CLASSIFICATION  OP  THIS  RAO*  <W|«1  Dmtm  tnfrvd) 


. v-'y.v  V- ' 


^ . . 


